A0-A053  4«»7 
UNCLASSIFieO 


AIR  FORCE  INST  OF  TECH  WFIOHT-PATTERSON  AFB  OHIO  SCH— ETC  F/*  1/4 
THE  55**®*<  OPTIMUM  ALPHANUMERIC  SYMBOL  SET  FOR  COCKPIT  OI--ETC(U) 

DCW  T F t p 0vSH 

AFIT/BC/EE/rr-ll 


MICROCOPY  RESOLUTION  TEST  CHART 

NATIONAL  BUREAU  OF  STANDARDS  196J-A 


AFIT/GE/EE/77-11 


CO 


TirESIS 

AFIT/GE/EE/77-11  Larry  F.  Bush 

Capt  USAF 


I 


Approved  for  public  release;  distribution  unlimited. 


AFIT/GE/EE/77-11 


THE  DESIGN  OF  AN  OPTIMUM  ALPHANUMERIC 
SYMBOL  SET  FOR  COCKPIT  DISPLAYS 


THESIS 

Presented  to  the  Faculty  of  the  School  of  Eng:ineering 
of  the  Air  Force  Institute  of  Technology 
Air  University 

in  Partial  Fulfillment  of  the 
Requirements  for  the  Degree  of 
Master  of  Science 


by 

Larry  F.  Bush,  B.S. 

Capt  USAF 

Graduate  Electrical  Engineering 
December  1977 


Approved  for  public  release;  distribution  unlimited 


o a 


Preface 


The  research  in  this  thesis  investigates  symbol 
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the  feature  domain  and  Euclidian  distance  is  used  as 
a basis  for  the  symbol  change  procedure.  The  pattern 
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I Abstract 

<1 

The  purpose  of  this  study  was  to  design  an  alpha- 
numeric sjnnbol  set  with  improved  legibility  which  could 
be  implemented  as  a standardized  set  for  cockpit  displays. 
Pattern  recognition  theory  was  applied  in  which  the  two- 
dimensional,  discrete,  Fourier  transform  v/as  used  to 
obtain  the  feature  space  for  each  symbol.  Spatial  low- 
pass  filtering  techniques  were  employed  and  a decision 
rule  was  used  based  on  the  maximum  nearest  prototype 
Euclidian  distance  between  symbols.  A symbol  change 
algorithm  was  developed  in  which  symbols  were  moved 
away  from  their  two  closest  neighbors  in  the  transformed 
domain.  Lincoln/Mitre,  a highly  legible  font,  was  used 
as  the  initial  prototype  set.  A new  alphanumeric  symbol 
set  was  designed  with  18  changed  symbols  that  had  an 
overall  greater  distance  spread  between  nearest  neighbor 
symbols.  The  Lincoln/Mitre  and  a new  test  set  were 
psychophysically  tested  with  results  indicating  a 9-5% 
better  performance  for  the  changed  symbols  over  the 
original  Lincoln/Mitre  symbols. 

1 , 
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THE  DESIGN  OF  AN  OPTIMUM  ALPHANUMERIC  SYMBOL  SET 


FOR  COCKPIT  DISPLAYS 

/ 


I . Introduction 


As  aircraft  have'  become  more  complex,  there  has 
been  a concomitant  increase  in  the  number  of  instru- 
ments in  cockpit  displays.  This  is  a consequence  of 
the  increasingly  demanding  performance  requirements  of 
the  aircraft  and  the  weapon  systems  they  contain.  This 
in  turn  results  in  a vast  amount  of  information  that  must 
be  interpreted  correctly  by  the  aircrew  to  insure  a safe 
and  successful  mission.  This  is  a task  which  is  es- 
pecially difficult  under  the  high  stress  conditions  of 
combat.  In  order  to  perform  adequately  all  the  require- 
ments for  a mission,  the  pilot  and  crew  need  flexible 
and  dependable  equipment  which  can  display  information 
without  misinterpretation.  As  a result,  the  light- 
emitted  diode  (LED)  and  the  cathode  ray  tube  (CRT)  are 
beginning  to  replace  the  mechanical  gauge  as  a display 
device . 

LED  devices  have  certain  favorable  characteris- 
tics compared  to  CRT's,  principally  in  the  areas  of  reli- 
ability, maintainability,  compactness,  and  logistic 
cost.  The  Air  Force  Flight  Dynamics  Laboratory  (AFFDL) 
is  presently  funding  research  and  development  of  green 
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LED  matrix  module  display  systems  that  may  lead  to  revo- 
lutionary advancements  in  cockpit  displays  (Ref  19)  ■ 

This  thesis  is  concerned  with  the  design  of  an  op- 
timal alphanumeric  symbol  set  that  can  be  implemented  as 
a standardized  set  for  LED  displays . Current  LED  dis- 
play units  using  a 10  X l4  height-to-width  ratio  for  each 
symbol  are  being  designed  and  tested  by  the  AFFDL.  In 
this  study,  the  10  X l4  matrix  will  be  used  as  the  max- 
imum size  of  each  symbol  in  the  set.  The  matrix  grid  of 
l40  diodes  (10  X l4)  can  contain  either  a charged  diode 
which  will  form  part  of  the  symbol  or  an  uncharged  diode 
representing  the  background.  By  making  slight  changes 
in  the  matrix,  it  is  possible  to  obtain  a very  large 
number  of  perfectly  readible  sets  having  differing  legi- 
bility. 

The  objective  of  this  thesis  is  to  produce  an  alpha- 
numeric symbol  set  that  will  achieve  the  most  uniform 
distribution  of  errors  between  symbols  and  also  decrease 
the  overall  errors  in  identification  when  compared  to 
other  fonts.  A decision  rule  using  the  nearest  prototype 
classifier  based  on  Euclidian  distance  is  used.  A basis 
for  this  study  is  the  discrete,  two-dimensional  Fourier 
transform  that  was  proposed  by  Kabrisky  as  a mathematical 
model  of  the  human  visual  information  processing  system 
(Ref  l6) , and  used  by  Goble  (Ref  11)  to  examine  the  dis- 
tribution of  errors  in  a letter  perception  test. 
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The  Lincoln/Mitre  font,  one  which  indicated  a high 
degree  of  legibility  from  AFFDL  testing,  is  used  as  the 
inputed  prototype  set  from  which  changes  will  be  made  to 
spread  the  distance  between  symbols  in  the  set.  A com- 
puter algorithm  is  used  that  centers  a symbol  in  an  array, 
computes  Fourier  coefficients,  applies  a low-pass  filter, 
performs  symbol  change  procedures,  and  does  other  related 
functions.  Investigations  are  made  into  ways  to  change 
the  dot  matrix  shape  of  the  symbols  that  have  been  iden- 
tified as  the  most  confusing.  Psychophysical  testing 
procedure  is  described  in  Chapter  IV  with  results  dis- 
cussed in  Chapter  V.  A conclusion  and  recommendation  for 
further  study  is  included  in  Chapter  VI . 
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II . Background 


The  human  eye-brain  network  is  a pattern  process- 
ing center  whose  vast  complex  functions  hold  many  clues 
to  the  pattern  recognition  problem.  A diagram  of  the 
visual  system  is  shown  in  Figure  1 below.  The  primary 


parts  of  the  human  visual  system  consists  of  the  retina 
of  the  eye,  the  optic  tract,  the  lateral  geniculate 
body,  the  cerebral  cortex,  and  neural  interconnections. 
The  image  the  eye  receives  is  focused  on  the  back  of  the 
retina  where  the  rods  and  cones  act  as  photo  sensitive 
receptors.  The  rods  differ  from  the  cones  in  that  they 
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are  more  sensitive  to  light  and  contribute  to  vision  at 
illumination  levels  much  lower  than  those  provided  by 
most  display  panels.  The  cones  operate  at  higher  illu- 
mination levels,  provide  better  contrast  sensitivity  and 
spectral  resolution,  and  provide  color  vision.  The  re- 
sponse consequent  to  their  stimulation  is  transfered  to 
a matrix  of  retinal  neurons  and  then  to  large  "ganglion" 
cells  whose  output  axons  form  the  approximately  one 
million  (10^)  channels  in  the  optic  nerve.  In  addition 
to  coding  the  light  pattern,  the  retina  is  a data  re- 
duction system  for  the  processed  image  since  the  10^ 
output  channels  are  derived  from  about  125  million  rods 
and  about  5-5  million  cones.  The  two  nerve  tracks  from 
each  eye  intersect  behind  the  eyes,  connect  to  the  lat- 
eral geniculate  bodies  in  the  brain  stem,  and  then  go  on 
to  the  cerebral  cortex  of  the  brain.  The  cerebral  cor- 
tex is  a sheet-like  structure  in  which  the  flow  of  input- 
output  data  takes  place  perpendicular  to  the  sheet.  The 
coded  visual  data  is  mapped  homomorphically  onto  the 
cortical  sheet;  there  is  a rich  interconnection  between 
neighboring  areas  of  the  cortex  in  which  Kabrisky  had 
theorized  a two  dimension  Fourier  transform  computation 
could  occur  and  characterize  information  transfer  in  the 
visual  data  processing  system  (Ref  l6) . 

There  has  been  much  work  done  on  Fourier  spatial 
frequency  analysis,  as  a model  for  the  recognition  of 
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two-dimensional  images,  by  students  at  the  Air  Force 
Institute  of  Technology  (AFIT) . Radoy  (Ref  20)  invest- 
igated the  use  of  Fourier  transforms  in  which  is  used 
the  low  spatial  frequencies  of  the  transform  to  success- 
fully classify  the  English  alphabet.  Further  work  by 
Tallman  (Ref  24)  verfied  the  use  of  spatial  low-pass 
filtering.  Using  24  handprinted  sets  of  the  English 
alphabet  and  keeping  only  the  first  three  spatial 
frequency  harmonics,  he  was  able  to  properly  classify 
95%  of  a test  set  which  the  machine  had  not  been  ex- 
posed to  previously.  He  properly  classified  99%>  of 
his  training  set. 

Instead  of  a Fourier  transform  a two-dimensional 
Walsh  transform  was  used  by  Carl  (Ref  3)  in  1969*  Using 
Tallman' s data  sets,  he  was  able  to  properly  classify 
95%>  of  the  test  set  within  12  training  iterations.  He 
concluded  that  filtering  can  be  used  to  facilitate 
reliable  machine  classification. 

Further  work  based  on  Kabrisky's  model  have  been 
applied  to  a variety  of  pattern  recognition  techniques. 
Besides  its  use  in  the  classification  of  handprinted 
alphabets  (Ref  24),  it  has  been  used  with  Chinese 
characters  (Ref  l)  , unique  symbol  sets  (Ref  15)  » a^nd 
in  speech  recognition  task  (Ref  6).  Ginsburg  (Ref  10) 
concluded  that  the  model  provided  a means  of  obtaining 
quantitative  psychological  correlates  of  the  human 
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visual  system.  Gins burg  showed  that  there  was  a re- 
lationship between  human  identification  error  and 
Euclidian  distance  when  using  low-pass  filtered  trans- 
forms of  rotated  letters. 

The  results  of  Goble's  dissertation  in  1975 
supported  the  use  of  the  inter-prototype  Euclidian 
dis-tance  matrix  as  a model  for  predicting  human  visual 
perception.  The  distance  matrix  were  produced  from 
a low-pass  two-dimensional  discrete  Fourier  or  Walsh 
transform  of  alphabetic  characters.  Small  Euclidian 
distances  represented  high  degree  of  recognition  errors 
while  the  opposite  was  true  for  large  Euclidian  distances 
(Ref  11) . 

Legibility  research  has  been  seeking  methods  to 
determine  better  legible  symbol  sets  with  observer's 
accuracy  and  speed  in  symbol  identification.  There  is 
a variety  of  adverse  conditions  under  which  operators 
read  displays;  such  as,  fatique,  stress,  clutter, 
vibration,  and  symbol  overprinting.  The  criteria  that 
will  finally  rate  one  set  better  than  another  has  to 
come  from  standardized  conditions  when  the  sets  are 
tested  using  human  subjects. 

A contractor  study  was  completed  in  1975  for  the 
Air  Force  Flight  Dynamics  Laboratory  which  investigated 
symbol  legibility  in  terms  of  behavior  data.  Color, 
contrast,  shape,  percent  active  area,  viewing  angle. 
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symbol  orientation,  symbol  definition,  symbol  subtence, 
vibration,  translation  and  rotation  were  among  the 
areas  which  influenced  human  perception  of  symbols. 

The  principal  conclusions  drawn  from  the  study  were: 

1.  The  parameters  having  the  major  effect  on 
display  legibility  are  percent  active  area, 
luminance,  contrast,  and  symbol  subtense. 

2.  Two  parameters  having  important  inter- 
actions are  symbol  definition  and  symbol 
orientation. 

3.  For  static  upright  symbology,  there  is 
not  a performance  difference  for  a 5 X 7 
matrix  or  greater. 

4.  When  a fixed  matrix  array  is  rotated,  the 
5x7  symbology  was  illegible,  while  the 
8 X 11  and  10  X 21  were  marginal  and 
legible  respectively  (Ref  25544,51). 

Further  discussion  of  symbolic  display  variables  and 
advanced  display  technology  can  be  found  in  Reference  21. 

Over  the  years,  several  alphanumeric  symbol  sets 
have  evolved  that  can  qualify  for  use  in  electronic 
displays.  Included  among  these  fonts  were  the  Lincoln/ 
Mitre,  Huddleston,  Namel,  ASCII,  IBM  029,  and  Leroy. 

See  Appendix  A and  C.  Separate  research  studies  con- 
ducted in  comparing  the  legibility  of  different  fonts 
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resulted  in  selecting  the  Lincoln/Mitre  font  as  a 
highly  legible  one.  Research  is  still  lacking  in 
developing  an  optimal  font  based  on  different  matrix 
size,  shape,  and  arrangements  (Ref  25!63,65). 

Recent  psychophysical  studies  at  the  AFFDL  have 
classified  the  Lincoln/Mitre  font  with  the  least 
human  misidentifications . The  development  of,  the 
Lincoln/Mitre  font  began  with  research  at  Lincoln 
Laboratory  in  Lexington,  Massachusetts  and  continues 
at  the  Mitre  Corporation  in  Bedford,  Massachusetts. 

It  resulted  from  the  work  of  several  researchers  over 
a ten  year  period.  The  Lincoln/lviitre  font  is  well 
suited  to  use  with  degraded  conditions  and  when  high 
speed  and  accuracy  of  identification  are  stressed 
(Ref  22:42,51) . 
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Introduction 

The  procedure  in  this  study  may  be  considered  as  a 
hueristic  approach  in  which  a Fourier  mathematical  model 
and  spatial  filtering  techniques  are  used  to  find  a so- 
lution -^o  the  legibility  problem.  Finding  more  legible 
alphanumeric  symbol  sets  does  not  follow  directly  to 
the  more  formal  pattern  recognition  approaches  to  classi 
fication,  but  uses  pattern  analysis  and  feature  extrac- 
tion to  formulate  decisions.  A nearest  neighbor  criter- 
ion function  is  utilized  as  a means  to  minimize  misclas- 
sification  error  between  symbols.  A development  of  the 
model  to  enhance  class  separability  and  legibility  is 
shown  in  Figure  2 (Ref  17) • 

In  this  process,  each  symbol  will  be  digitized  into 
a pattern  space  so  it  can  be  processed  by  a computer. 

The  discrete,  two-dimensional  Fourier  transform  will 
act  as  a feature  selector.  Low-pass  filtering  will  be 
used  to  reduce  the  feature  space  of  each  symbol  into  a 
49  component  space.  Euclidian  distance  between  each 
transformed  symbol  will  be  used  to  determine  the  most 
confused  symbols  in  each  set.  A symbol  change  algorithm 
will  increase  the  distances  between  symbols  as  a means 
to  increase  separability  between  symbols.  The  inverse 
Fourier  transform  is  performed  and  a spatial  image  is 
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printed  of  the  symbol  being  changed. 

Input  Symbol  Representation 

The  Lincoln/Mitre,  ASCII,  Huddleston,  and  Namel, 
four  fonts  being  tested  by  AFFDL,  were  digitized  on  a 
l4  unit  high  by  10  unit  wide  matrix  grid.  The  IBM  font 
was  also  included  in  this  process.  A Digital  Equipment 
Corporation  PDP-12  computer  and  a flying  spot  scanner 
are  used  to  generate  digitized  symbols.  All  fonts  were 
size  normalized  to  the  10  X l4  grid.  Each  digitized 
symbol  was  punched  onto  computer  cards  by  assigning  an 
intensity  of  1 or  0 to  each  grid  of  the  input  matrix. 

An  example  of  the  Lincoln/Mitre  "A"  is  shown  in  Figure 
3 below. 


Figure  3*  Lincoln/Mitre  "A" 


Each  symbol  is  centered  and  imbeded  in  a zeroed  30 
X 42  window  array  three  times  its  size.  The  reason  for 
this  is  to  reduce  edge  effects,  aliasing  and  leakage 
conditions  that  could  occur  due  to  the  periodic  nature 
of  the  Fourier  transform.  A test  is  made  in  Appendix  F 
to  determine  what  differences  occur  if  symbols  are  used 
that  fill  the  entire  window.  Figure  4 below  shows  the 
30  X 42  input  array  of  the  letter  "A"  before  computing 
its  Fourier  transform. 


1^ 30  


Figure  4.  Window  with  Symbol  Centered 
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Fourier  Analysis 

There  is  a wide  application  of  Fourier  theory  in 
electrical  networks  and  communication  systems.  Recent 
development  of  the  gas  laser  has  also  resulted  in  tech- 
nical advances  in  Fourier  optics.  Fraunhofer  diffrac- 
tion patterns  are  discussed  in  Appendix  D.  In  this 
thdsis  the  Fourier  transform  is  used  as  an  analytical 
tool  in  two-dimensional  image  classification.  Using 
present  computer  technology,  the  Fourier  transform  has 
adapted  itself  well  to  visual  scene  analysis. 

The  two-dimensional  discrete  Fourier  transform  is 
used  to  convert  each  symbol  into  its  spatial  frequency 
components.  The  inputted  symbol  represents  the  pattern 
space  whose  matrix  size  is  in  units  of  length.  By  means 
of  the  transform,  the  feature  space  represents  the  spec- 
trum or  spatial  frequencies  of  the  symbol.  The  Fourier 
transform  may  be  considered  as  performing  a spectral 
resolution  of  the  pattern  in  which  the  corresponding 
phase  and  amplitude  elements  are  obtained  in  a complex 
valued  array  (Ref  24:20,21). 

The  discrete,  two-dimensional  Fourier  transform 
for  a M X N array  X is: 

N M r 

n = T.  L ^m  n -j2n  (mp/M)  + (nq/)' 

n = l m=l  ’ L 

where  X^^  is  a discrete  point  in  the  input  pattern 
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space  and  ^ is  the  complex  spatial  frequency  term  in 
the  feature  space. 

The  Fourier  transform  is  an  orthogonal  transform 
that  preserves  distance  and  angle  between  vectors. 

This  transform  changes  the  original  array  X into  a set 
of  Fourier  coefficients  that  represent  a system  of  sine 
and  cosine  functions  which  contain  all  the  information 
of  the  original  symbol.  By  using  Euler's  identity; 

= COS0  + jsin9  (2) 

equation  (l)  can  be  written  as: 


r*  • •••  *1“  - j 


sin 


2nmv 

M 


-j=in  ^ 


(3) 


The  inverse  Fourier  transform  is; 


N-1  M-1  r 1 

\,n  = MN  q5o  p?o  + (nq/N)J  (4) 

By  adding  the  correct  Fourier  components  in  the  spatial 
frequency  domain,  the  exact  input  pattern  can  be  repro- 
duced. Any  visual  pattern  can  be  reconstructed  from  a 
finite  sum  of  spatial  sinusoids  of  appropriate  spatial 
frequency , orientation,  and  amplitude.  Like  the  Laplace 
transform,  the  Fourier  transform  is  a linear  operator 
with  specific  properties.  Explanation  and  proof  of  the 
Fourier  properties  are  found  in  Reference  12  and  24. 
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Difcital  Computation 

A Cooley-Tukey  fast  Fourier  transform  (Ref  5)  is 
used  in  a subroutine  called  FOURT  on  the  Control  Data 
Corporation  (CDC)  6600  computer  to  calculate  each  sym- 
bols Fourier  components.  The  zero  frequency  or  DC  term 
in  the  feature  space  (Fourier  space)  is  located  in  the 
upper  left  corner  of  the  real  array.  The  DC  term  has  a 
zero  imaginary  component.  It  is  easier  to  locate  the 
harmonics  of  the  transformed  frequencies  by  rearranging 
the  FOURT  matrix.  Remember  also  that  the  imaginary  part 
of  each  component  is  arranged  in  the  same  format.  If  the 
even  30  X 42  matrix  (real  and  imaginary)  is  divided  into 
four  equal  quadrants,  the  rearranged  format  is  created 
by  switching  the  first  and  fourth,  and  the  second  and 
third  quadrants  as  shown  in  Figure 

Now  the  DC  term  will  be  located  by  the  equation: 

DC  (p,q)  = (M/2  + 1),  (N/2  + 1)  (5) 

The  value  of  the  DC  term  represents  the  total  number 
of  I’s  in  the  inputted  symbol.  The  first  harmonic  is  the 
first  ring  of  components  surrounding  the  DC  term  and  so 
forth  for  higher  harmonics  as  shown  in  Figure  6.  This 
method  is  used  to  identify  the  features  that  will  be 
selected  after  filtering. 

Spatial  Filtering 

The  number  of  features  is  reduced  by  spatial  fil- 
tering. The  application  of  the  low  pass  filtering  is 
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Figure  6.  1st,  2nd,  3rd,  and  7th  Harmonic  of  Window 

based  on  the  heuristic  approach  that  has  appeared  re- 
liable in  symbol  classification  research  in  this  area 
(Ref  3,4,20,24).  One  value  of  filtering  is  that  it 
reduces  computer  computation  time.  The  low  spatial 
frequencies  contain  the  majority  of  the  energy  or  infor- 
mation of  the  original  symbol.  The  fine  detail  or  edge 
information  is  found  in  the  higher  spatial  frequencies 
(Ref  7«309).  A low-pass,  3rd  harmonic  filter  of  the 
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Figure  7»  9th  Harmonic  of  Window 

symbol  was  used.  A 3rd  harmonic  filter  of  the  symbol 
corresponds  to  the  9th  harmonic  of  the  window  as  shown 
in  Figure  7 above.  Every  3rd  component  in  the  first  9 
rings  around  the  DC  term  was  kept  as  shown  in  the  re- 
arranged matrix  and  the  FOURT  Fourier  component  config- 
uration in  Figure  8.  When  the  inverse  transform  is 
applied  to  this  filtered  array,  9 repeats  of  the  symbol 
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occur  as  shown  in  Figure  9*  This  is  due  to  the  harmonic 
relationship  hetween  the  window  and  the  actual  symbol. 
This  is  corrected  to  produce  only  one  output  prototype 
(Fig  10),  by  repacking  the  filtered  components  (Fig  11). 
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Figure  9*  Reproduced  Symbol  with  9 Repeats 
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Figure  10. Reproduced  Symbol  Using  Repacked  Filter 
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Due  to  the  symmetry  property  of  the  Fourier  trans- 
form, half  of  the  features  are  identical.  Figure  12 
shows  the  real  rearranged  array  with  the  Fourier  compo- 
nents that  were  obtained  from  the  Lincoln/lviitre  (L/M) 
"P."  The  DC  term  equals  64,  the  number  of  I's  in  the 
symbol  "P."  The  line  of  symmetry  separates  the  array 
into  two  identical  parts,  and  is  the  same  for  the  imag- 
inary transformed  array.  The  number  of  features  needed 
can  be  reduced  in  half  in  which  only  the  top  25  real  and 
top  24  imaginary  components  are  saved  as  shown  in  Figure 
13.  The  imaginary  part  of  the  DC  term  is  always  zero. 
All  other  components  are  set  to  zero  in  the  filter  rou- 
tine . 

The  DC  feature  is  a measure  of  the  average  energy 
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Figure  13 . Fourier  Real  Array  with  25  Components 

of  the  symbol.  Since  each  matrix  dot  is  of  the  same 
intensity  and  different  symbols  contain  varied  numbers 
of  charged  LED's,  there  is  different  intensity  levels 
between  symbols.  The  DC  term  was  set  to  zero  in  the 
filter  due  to  the  insignificance  of  intensity  in  symbol 
identification  in  the  human  visual  system  (Ref  13i317) 
and  (Ref  14j47) . 
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Campbell  and  Robson  (Ref  2 : 551-55^  measured  the 
contrast  thresholds  of  a variety  of  grating  patterns 


over  a wide  range  of  spatial  frequeneies.  They  obtained 
curves  for  2°  and  for  10°  fields  of  view  with  viewing 
distances  of  285  cm  and  57  cm  as  shown  in  Figure  l4. 


Figure  l4.  Contrast  sensitivity  for  sine-wave  gratings. 

Luminance  500  cd/m^.  Viewing  distance  285 
cm  and  aperture  2“ x 2°  ,A;  viewing  distance 
57  cm,  aperture  10°  x 10°  viewing  dis- 
tance 57  cm,  aperture  2”x  2°,0*  (From  Ref 
2j554) 


The  low  spatial  frequencies  show  small  contrast  sensi- 
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tivity  with  the  valve  increasing  to  a peak  at  approx- 
imately 3 cycles/degree . This  is  shown  in  Figure  3 
where  the  zero  cycles/degree  (DC)  has  insignificant 
contrast.  Gagnon  (Ref  9)  shows  similar  results  using 
the  modulation  transfer  function  (MTF) . The  human  MTF 
normally  shows  the  spatial  frequency  response  of  the 
eye  to  have  a peak  sensitive  from  5 cycles/degree  to 
12  cycles/degree,  depending  on  the  brightness,  and 
lower  frequencies  attenuated  for  monochromatic,  foveal 
stimuli . 

Energy  Normalization 

The  filtered  array  X is  energy  normalized  by  the 
equation : 


’ 30  42 

X.  . = X.  . 

1 1 J 1 1 J 

i=l  j=l 

*1.0 

2 

where  3(i,j  is  the  normalized  ijth  element.  Each  symbol 
is  made  up  of  the  49  features.  The  49  Fourier  components 
can  define  a vector  or  point  in  49  space.  When  normal- 
ized, the  point  representing  each  symbol  will  fall  on 
a hypersphere  of  radius  1.  The  distance  between  symbols 
will  be  chords  on  this  sphere.  The  maximum  distance 
being  two.  After  normalization,  the  total  energy  in 
X is  equal  to  unity. 

A normalized  function  is  identical  to  the  same 
function  with  a gain  constant  multiple.  This  is  shov,Ti 


27 


below  where  the  Fourier  transform  of  the  function  is: 


and  after  is  energy  normalized  it  is  represented  by: 


Now  if  a^  has  a gain,  k where  b^  = ka^,  the  Fourier 
transform  is: 

ka  kA  = B 

n P P 

and  after  it  is  energy  normalized: 


V 

N = 1 ^ 


/N  ; 

z 

fN  = l ^ 


Equation  (7)  and  (9)  are  the  same.  This  shows  how 
normalization  can  equate  symbols  with  gain  differences. 
This  could  be  caused  by  a varied  intensity  of  the  charged 
light-emitted  diodes. 

Energy  normalization  is  performed  after  filtering 
to  insure  that  the  points  for  each  prototype  fall  on 
the  unit  hypersphere.  Energy  normalization  prior  to 
filtering  will  cause  the  points  to  move  off  the  hyper- 
sphere once  filtering  takes  place. 

Euclj  dian  Distance 

The  49  energy  normalized  Fourier  components  can 
now  represent  features  of  each  symbol.  Each  36  symbol 
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alphanumeric  font  can  be  plotted  as  36  points  on  a 
hypersphere.  Similar  symbols  may  cluster  together  on 
the  hypersphere.  It  is  these  symbols  that  are  close 
together  in  the  Euclidian  distance  sense  that  are 
likely  to  be  more  confused  than  those  that  are  far 
apart.  The  Euclidian  distance  between  each  vector 
or  'symbol  on  the  hypersphere  is  calculated . The 
symbols  and  their  distances  can  be  compared  to  their 
human  error  data.  If  A and  B are  the  position  vectors 
of  two  symbols  in  the  feature  space  and  the  distance 
between  A and  B is : 

Dist^g  = |a-b|  (10) 

and  the  Euclidian  distance  for  49  component  vectors  is 


Dist 


AB 


(11) 


where  A^  and  B^  are  the  i^^  component  of  each  vector. 

A distance  matrix  between  all  symbols  in  a set  is 
printed  out.  Table  I and  Table  II  are  the  Euclidian  dis- 
tance matrix  for  the  Lincoln/Mitre  font.  The  value  on 
one  side  of  the  diagonal  of  the  matrix  are  identical 
to  those  on  the  other  side.  This  is  due  to  the  fact 
that  the  distance  from  "A"  to  "B"  is  the  same  as  from 
"B"  to  "A".  The  diagonal  elements  are  zero  since  the 
distance  from  each  symbol  to  itself  is  zero.  The 
statistical  mean  and  the  standard  deviation  are  also 
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Table  I 

Lincoln/Mitre  Distance  Matrix 
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calculated.  They  are  printed  below  the  distance 
matrix . 

All  the  distances  are  then  sorted  from  smallest 
to  largest.  Table  III  shows  the  sorted  A-L  for  the 
Lincoln/Mitre  set;  Table  IV  shws  M-X , while  Table  V 
shows  Y-9. 

- A histogram  of  the  Euclidian  distance  data  for  the 
Lincoln/Mitre  font  is  shown  in  Figure  15*  The  distance 
range  between  0 and  2 was  divided  into  kO  bins  or  .05 
units  for  each  subdivision  on  the. distance  axis.  For 
example,  bin  20  represents  distances  greater  than  .95 
and  less  than  or  equal  to  1.00.  Note  for  a symbol 
font  there  are  always  36  values  in  bin  1 due  to  zero 
distance  for  like  symbols . The  distance  histogram 
shows  a gaussian  distribution  of  the  data. 

Symbol  Rating 

Using  Euclidian  distance  as  a decision  rule,  each 
font  can  be  rank  ordered  by  the  distance  between  each 
symbol  and  its  nearest -neighbor . The  symbol  pair  with 
the  shortest  distance  is  ranked  first  in  being  a con- 
fusing symbol.  All  pairs  are  ranked  until  the  last 
pair  is  the  one  with  the  greatest  distance  between 
prototype  and  nearest  neighbor  as  shown  in  Table  VI. 

Now  the  column  is  checked  for  each  pair  in  order  to 
rank  the  two  symbols  in  the  pair.  A rank  order  of  the 
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Table  III 

Lincoln/Mitre  Sorted  Distance  Matrix 
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Table  IV 

Lincoln/Mitre  Sorted  Distance  Matrix 
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Table  V 

Lincoln/Mitre  Sorted  Distance  Matrix 
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Table  VI. 


Symbol  Pair  Confusability  Data 


PAIR 

DIST 

PAIR 

DIST 

PAIR 

DIST 

PF 

.6251 

NH 

.8933 

83 

1.0355 

FP 

.6251 

ZE 

.9008 

XY 

1.0532 

EF 

.6255 

MH 

.9084 

GO 

1.0767 

RP 

.6730 

7T 

.9382 

J6 

1.0828 

T1 

.6779 

QR 

.9740 

6j 

1.0828 

IT 

.6779 

BD 

.9751 

1.1021 

OU 

.7098 

DB 

.9751 

CE 

1.1337 

UTO 

.7098 

Y7 

.9887 

SG 

1.1676 

LE 

.8224 

VW 

1.0146 

97 

1.1737 

5E 

.8462 

WV 

1.0146 

4G 

1.1785 

IT 

.8680 

KR 

1.0146 

28 

1.1794 

HN 

.8933 

38 

1.0355 

A6 

1.2806 

Table 

VII. 

Symbol  Confusability  Rating 

1 

- F 

10-5 

19  - D 

28  - 

J 

2 

- P 

11  - I 

20  - Y 

29  - 

6 

3 

- E 

12  - H 

21  - W 

30  - 

0 

4 

- R 

13  - N 

22  - V 

31  - 

C 

5 

- T 

l4  - Z 

23  - K 

32  - 

s 

6 

- 1 

15  - M 

24-8 

33  - 

9 ■ 

7 

- U 

16-7 

25-3 

34  - 

4 

8 

- 0 

17  - Q 

26  - X 

35  - 

2 

9 

- L 

18  - B 

27  - G 

36  - 

A 

Lincoln/Mitre 

set  is  shown 

in  Table  VII 

above 

. "F"  is 

the  most  confused  symbol. 

Using  the  psychophysical  data  compiled  from  human 
testing  of  the  Lincoln/Mitre  font,  the  set  can  be  rank 
ordered  according  to  the  number  of  errors  in  identifi- 
cation. Table  VIII  shows  in  descending  order  the  number 
of  errors  for  each  symbol  and  Figure  1 6 is  a plot  of 
this  data.  The  confusion  m.atrix  obtained  from  the  psy- 
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Table  VIII 

Lincoln/Mitre  Psychophysical  Data 
Total  Errors  by  Rank  for  6 Subjects 


Rank 

Symbol 

Total  Errors 

1 

P 

130 

2 

0 

99 

3 

1 

88 

4 

I 

84 

5 

F 

82 

6 

U 

75 

7 

S 

72 

8 

L 

72 

9 

5 

71 

10 

G 

59 

11 

E 

58 

12 

6 

57 

B 

55 

14 

3 

50 

15 

J 

48 

16 

T j 

47 

17 

8 

46 

18 

H 

35 

19 

4 

35 

20 

N 

32 

21 

Y 

30 

22 

R 

29 

23 

2 

29 

24 

Z 

28 

25 

D 

26 

26 

W 

I 25 

27 

M 

24 

28 

24 

29 

V 

23 

30 

X 

21 

31 

Q 

19 

32 

K 

17 

33 

C 

17 

34 

9 

13 

35 

7 

12 

36 

A 

7 

38 


Figure  17.  Confusion  Matrix  for  Lincoln/Mitre  Set 


chophysical  data  of  6 subjects  tested  on  the  Lincoln/ 

Mitre  set  is  shown  in  Figure  17. 

Another  method  of  rating  symbols  compares  the  aver- 
age distance  from  a symbol  to  all  the  others.  The  distances 
from  a symbol  to  its  other  35  neighbors  are  summed,  and 
the  mean  distance  is  found  for  each  symbol.  The  results 
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of  this  method  is  shovm  in  Table  IX. 

Symbol  Change  Procedure 

The  objective  of  this  study  was  to  produce  an  alpha- 
numeric symbol  set  that  is  more  human  recognizable  and 
with  less  errors  between  symbols.  Since  the  Lincoln/ 
Mitre  was  a highly  legible  set,  it  is  chosen  as  the  pro- 
totype set  in  which  improvements  are  made.  The  Lincoln/ 
Mitre  font  is  shown  in  Appendix  A.  A nearest  neighbor 
decision  rule  is  used  to  decrease  the  possibility  of 
a neighbor  being  classified  as  the  prototype  and  to  make 
symbols  more  separable.  This  is  performed  by  identifying 
symbol  pairs  with  small  Euclidian  distances  and  moving 
them  further  apart . 

The  Euclidian  distance  rule  can  also  be  expressed 
as  an  equivalent  correlation  rule.  The  prototype  and 
nearest  neighbor  (smallest  distance)  compare  to  maximum 
''relation . By  decreasing  the  correlation,  this  would 
be  another  means  similar  to  increased  distance  movement 
that  would  make  symbols  less  confusing. 

Comparing  the  human  error  data.  Table  VIII  to  the 
distance  rating  in  Table  VII  for  the  Lincoln/Mitre  set, 
there  is  a good  deal  of  agreement  between  human  errors 
and  small  distances  of  separation.  Eight  out  of  the 
first  twelve  symbols  identified  by  the  Lincoln/Mitre 
distance  rating  were  among  the  first  twelve  found  using 


human  data. 
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Table  IX 
Lincoln/Mitre 

Ordered  Mean  Distance  to  Neighbors 


Rank 

Symbol 

Mean  Distance 

1 

E 

1.266 

2 

P 

1.280 

3 

F 

1.282 

4 

5 

1.294 

5 

R 

1.306 

6 

8 

1.306 

7 

0 

I.3O8 

8 

U 

1.321 

9 

L 

1.324 

10 

Z 

1.324 

11 

H 

1.328 

12 

3 

1.361 

13 

I 

1 . 364 

l4 

G 

1.367 

15 

2 

1.368 

16 

Q 

1.375 

17 

7 

1.376 

18 

B 

1.378 

19 

N 

1.378 

20 

J 

1.380 

21 

C 

1.382 

22 

S 

1.384 

23 

K 

1.387 

24 

T 

1.388 

25 

W 

1.395 

26 

V 

1.396 

27 

Y 

1.399 

28 

X 

1.403 

29 

M 

1 .406 

30 

9 

1.406 

31 

6 

1.4i6 

32 

1 

1.428 

33 

4 

1.442 

34 

D 

1.454 

35 

0 

1.464 

36 

A 

1.502 
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Symbol  Chanpie  Al/rorithm . A symbol  change  algorithm 
was  designed  in  which  the  most  confusing  symbols  using 
human  data  results  for  the  Lincoln/Mitre  set  were  to  be 
changed.  Most  of  the  errors  in  identification  resulted 
from  the  two  nearest  neighbors.  Using  this  fact,  the 
algorithm  was  designed  to  move  a confusing  symbol  away 
from  its  two  nearest  neighbors.  A flowchart  represent- 
ing the  change  algorithm  is  shown  in  Figure  l8. 

The  midpoint  of  the  two  nearest  neighbors  is  found 
by  the  equation: 


D.  = 
1 


Ai  . B. 


(12) 


where  is  the  ith  component  of  the  midpoint.  The 
symbol  to  be  changed  is  moved  away  from  this  midpoint 
by  a percentage  of  the  distance  from  the  midpoint  and 
itself.  The  percentage  of  movement  is  called  weight 
(WT)  in  the  equation; 


C = (C  , , - D. )WT  + C , , 

new^  old^  i'  old^ 


(13) 


where  represents  the  new  symbol  after  a movement. 

The  change  equation  can  be  shown  in  vector  notation. 
Figure  19  shows  D as  the  midpoint  and  A and  B as  the 
two  nearest  neighbors.  Letting  WT  = l/lO  and  equal 
the  new  point  X after  the  first  movement,  the  vector  V 
equals  l/lO  TO,  or  V = l/lO(X  - D) . Therefore  X^  = 

X + l/lO(X  - D)  or  X^  = ll/lO  X - l/lO  D. 
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Calculate  new 
ordered 
distance 
matrix 


Increase 

weip:ht 

(wt) 


changes? 


"rTearost  ? neighborh’^-w  No 
^ changed? 


Figure  18.  Symbol  Change  Algorithm  Flow  Chart 


Figure  19-  Vector  Representation  of  Change  Equation 


Printing  the  Changed  Symbol . Each  time  a new  chang- 
ed symbol  position  is  found,  it  is  again  energy  normal- 
ized to  bring  it  back  to  the  hypersphere.  The  inverse 
Fourier  transform  is  performed  and  the  changed  symbol 
is  printed  out  from  a subroutine  that  assigns  a gray 
level  to  the  inverse  Fourier  components.  The  minimum 
and  maximum  inverse  Fourier  com.ponents  are  found.  Gray 
level  values  ranging  between  1 and  20  are  then  assigned 
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by  the  equation: 


(X  - Min) 
gray  level  = 1 + 19  Max Min" 

where  X is  the  inverse  Fourier  component  of  the  m,n 
m ,n  ^ 

point  in  the  JO  X 42  matrix. 

The  30  X 42  matrix  of  gray  level  values  is  then 
divided  evenly  into  3^3  squares.  These  squares  will 
form  a 10  X l4  matrix,  the  same  size  as  the  original 
inputted  symbol.  A threshold  equal  to  90  is  establish- 
ed in  which  the  sum  of  each  square  (containing  nine  com- 
ponents) is  tested.  If  this  sum  is  greater  or  equal  to 
the  threshold,  the  corresponding  square  is  assigned  a 
one  (1).  If  it  is  less,  it  is  assigned  a zero.  In  this 
way,  a new  10  X l4  symbol  is  produced. 

By  an  overprinting  procedure,  a printout  of  the 
symbol  is  obtained.  Due  to  filtering,  the  identical 
symbol  is  not  reproduced,  but  the  symbol  is  still  rec- 
ognizable. By  moving  the  symbol  further  away  from  its 
two  closest  neighbors,  its  Fourier  components  are  chang- 
ed. This  inturn  causes  a new  symbol  printout  each  time 
a new  move  is. made.  In  the  algorithm^  each  symbol  was 
moved  ten  times  and  printed  after  each  move.  The  two 
nearest  neighbors  were  checked  for  changes  in  other  sym- . 
bols  coming  closer  after  the  move.  If  a new  neighbor 
was  closer  after  the  move,  the  symbol  change  equation 
would  move  the  last  location  of  the  prototype  being 
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changed  away  from  the  new  midpoint  of  these  new  neigh- 
bors . An  example  using  the  letter  "P"  and  its  closest 
neighbors  "F"  and  "R"  is  shown  in  Figure  20  on  the  next 
page.  After  the  10th  movement,  the  curved  part  of  the 
"P"  is  highlighted  and  indicates  that  more  information 
should  be  placed  in  the  loop  of  the  "P."  The  final 
(lO  X l4)  "P"  that  was  obtained  is  shown  compared  with 
the  original  in  Figure  21 , 


Lincoln/Mitre 

Final  Set 

11111111  — 

11111111-- 

111111111- 

11-11111- 

11 111 

11 1111 

11 11 

11 1111 

11 11 

11 1111 

11 111 

11 1111 

111111111- 

11--11111- 

11111111  — 

11111111  — 

11 

11 

11 

11 

11 

11 

11 

11 

11 

11 

11 

11 

Figure  21 . "P" : Lincoln/Mitre  and  Final  Set 


The  symbol  change  procedure  was  also  applied  to  a 
transformed  symbol  without  filtering.  All  the  Fourier 
components  of  the  symbol  were  used  in  the  change  al- 
gorithm. The  results  of  printing  the  symbol  "P"  fr6m 
its  inverse  Fourier  coefficients  after  5 movements  is 
shown  in  Figure  22.  Both  the  filtered  and  non-filtered 
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Fi^oire  20.  Ten  Movements  of  Lincoln/Mitre  "P"  from  two  closest  neiehhors 


yyxy>fyyyvxyYyxyy'<yyyyxx>:xxxyxy 

yyyy^yYvyyyyvyyyyyYyyyyvyyyyyy 

yYyY<yyyyyyyyYyvyyy<yyyyyyxyvy 

yxyY'yyYYyxyyyYYyyyYyYxvyyyxy> 

yyYXiyyvyyyyvyyyvYyyyyyyyvyYvy 


XXyy<yyy<XXXXYYyxyxyyyYYyyYxxy 

yyyyvYYYYyyyX''yYyyY'<yyYyxyyxyy 

xyyy<yyyyy''yvvyYYvyYyv<vyYxxyy 

yyXy<yYXYyxxyxvyyxyyxYxyxyyyYy 

yyyy<YxyxyyxxyyyyyYXxyY;<xxyyyy 

xxxxvxyyyxxyYyYXYYyyxxxyxvyyyv 

yyyyyYyyyxxyyyYYyyyyyxyYxyyxxx 
xxx X XYYxyy xxxYXYxyyxYYYYxyxy y y 


xxxxxxxyxyyyxxxxxxyxxxxxxYxxyY 
XXXX  <xxxxy!|?!a9Sf3a9  xxxyxyyyyy 


xxxxxyxYxxQ^aaaagaa  xYxyxyyyyy 

XyXX<yyX<Xeia'<XXvyBggxxxxxyyxxy 

xxy  X <yyxxxa9xxyyxy33x yyx yyyyyy 

xxxxxYXXxxaaxyxxyx^'avxxxxyxyyy 

XXXX<YXVXyS9yyXYY3f,"3XyYYyYYxyy 

xxxxxx YYXxeaaaaagae  yyxYyyxyyy 
xxxx  < vyYxx  aaaaaatja  XYXXxvxyxx 
xxxxxYyxxx99y“--yxvxxxYxxyxxxY 
xxxx^xxxxxgayy — yyyxxxxxYYXxy 
xxxx<yxxxxeaxxx — xxxxxxxxxxyy 
xxxx^yxYYxaaxxxx — xxyyyyyyxyy 
xxyyxyyxxyaaxxxYX — xxxxxyxyxx 
yxxy<Yyxyx8ayyyyxy--yxyxxyxxxy 
yxxxyyyyxxxxyyxxyyxxxyxyxxxyxy 
xxxxxyxxxyxyxyxyyyxyxxxyxyxxxy 
xxx  y <yyyxxxyxy yxyyy xxyxyyyyx  yy 
xxxY <xyyxxxxxyy yyyyyy yyyyyyyyy 
yxxxxyxyxxxyxxxxxxyyyyxxxyyxyy 
xxxx  <xxxxxxxyxxyyxyyxyxyxyxxxy 
xxxx<xxyyxxyyxxyxxy'''xxxxxyxxxy 
xxxxxxxvxxxxyxxyyyvxxyyyxyxxxy 
xxxx<yxxxxxyxxxxyyxxxy<xxyxxx> 
xxxx<yxx<xxx<yxxxxxyxxxyxyxxxy 
yxxx<xxx<xxxxxxyxxyyxy'<yxyxxxy 

XXyyyyyyXYXyyyYyyyyyxyxxxyyyyy 

xxxxxyxxxxxxxyxyxyxxxxxxxyxxxy 

xxxxxxxxxxxxxxxxxxxxxxxxxxxxxx 


Figure  22.  Lincoln/Mitre  "P"  after  5 Movements 
with  no  Filter 


procedure  demonstrated  similarities  in  identifying  which 
"P"  pixel  elements  are  stressed  the'  most  after  the 
same  amount  of  change . 

After  each  movement  of  the  symbol  in  the  feature 
space,  there  is  a continuous  increase  in  the  distance 
between  the  prototype  and  its  tv;o  nearest  neighbors. 
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Figure  23-  "P"  Movement  from  Nearest  Neighbors 


Table  X shows  the  first  6 nearest  neighbors  of  the  Lin- 
coln/Mitre "P"  after  10  movements.  Figure  23  shows  how 
the  two  nearest  neighbors  "F."  and  "R"  moved  after  each 
change  of  the  Lincoln/Mitre  "P."  Most  of  the  distance 
spread  for  the  first  two  neighbors  occured  in  the  first 
five  moves.  On  the  fifth  move  both  the  "F"  and  "R"  were 
replaced  by  other  symbols  as  the  nearest  neighbors.  For 
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Table  X 

Lincoln/Mitre  "P"  after  10  Movements 


Movem.ent 

1st 

Neighbor  with  Distance 

2nd  3rd  4th  5th 

6th 

1 

F 

R 

E 

H 

0 

Z 

.6515 

.7010 

.9455 

.9620 

1.1218 

1.1235 

2 

F 

R 

H 

E 

0 

Z 

.7097 

.7616 

.9816 

.9879 

1.1162 

1.1352 

3 

F 

R 

H 

E 

0 

8 

.8116 

.8659 

1.0215 

1.0620 

1.1124 

1.1562 

4 

F 

R 

H 

0 

8 

E 

.9693 

1.0241 

1.0949 

1.1197 

1.1554 

1.1774 

5 

0 

8 

B 

F 

H 

R 

1.1528 

1.1747 

1.1753 

1.1801 

1.2095 

1.2323 

6 

B 

F 

0 

8 

H 

9 

1.1924 

1.1946 

1.2003 

1.2220 

1.2235 

1.2358 

7 

0 

B 

F 

H 

8 

9 

1.2115 

1.2228 

1.2250 

1.2374 

1 . 2442 

1.2445 

8 

F 

H 

9 

0 

M 

R 

1.2285 

1.2427 

1.2458 

1.2510 

1.2563 

1.2568 

9 

9 

B 

2 

0 

V 

7 

1.2382 

1.2557 

1.2676 

1.2682 

1.2695 

1.2810 

10 

M 

0 

V 

2 

F 

9 

1.2545 

1.2672 

1.2731 

1.2774 

1.2806 

1.2813 

weight  equal  to  one-tenth,  there  was  enough  information 
after  10  movements  to  determine  which  features  will  make 
the  prototype  more  recognizable  from  its  two  nearest 
neighbors.  It  must  be  remembered,  that  changing  a sym- 
bol may  infact  cause  it  to  come  closer  to  other  symbols 
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in  the  set  and  therefore  negate  the  results  of  the 
original  change.  Once  a new  symbol  set  is  found, 
its  ordered  Euclidian  distance  matrix  is  calculated 
and  compared  to  the  original  set.  All  symbols  in  the 
set  are  checked  to  verify  how  much  of  an  increase  or 
decrease  in  distance  between  each  symbol  and  its  two 
nearest  neighbors.  Symbol  pairs  with  decreased  dis- 
tances were  then  reconsidered  for  further  change.  It 
is  noted  that  slight  changes  in  a symbol  could  cause 
the  nearest  neighbors  to  change  and  also  reduce  dis- 
tances in  some  cases. 

This  particular  procedure  was  repeated  for  the 
most  confusing  of  the  Lincoln/Mitre  symbols.  A new 
symbol  set  was  found  based  on  information  obtained 
from  the  changed  symbol  printout.  Basic  criteria  was 
always  to  try  to  increase  the  original  Lincoln/Mitre 
symbol  distances.  A total  of  11  different  sets  were 
run  with  the  final  set  11  showing  the  best  increase 
in  distance  between  prototype  and  nearest  neighbors. 
Set  11  is  shown  in  Appendix  B.  A total  of  l8  symbols 
were  changed  from  the  original  Lincoln/Mitre  font . 
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IV.  Testing: 

Psychophysical  testing  of  the  newly  generated  font, 
refered  to  as  set  11,  was  conducted  in  conjunction  with 
the  Multimode  Matrix  Display  Office  of  the  Air  Force 
Flight  Dynamics  Laboratory.  The  objective  of  the  test- 
ing was  to  determine  the  legibility  of  set  11  compared 
to  the  Lincoln/Mitre  font.  A font's  legibility  is  de- 
termined by  the  total  errors  made  in  identification  under 
similar  viewing  conditions. 

Sub j ects 

A total  of  two  male  personnel  served  as  subjects. 
Each  subject  had  20/20  or  corrected  20/20  vision,  and 
none  reported  any  evidence  of  visual  abnormalities.  One 
of  the  subjects  had  participated  in  a previous  experiment 
dealing  with  font  legibility. 

Stimuli 

The  test  stimuli  consisted  of  the  letters  A-Z  and 
numbers  0-9  in  both  the  Lincoln/Mitre  and  in  set  11. 

The  stimuli  were  photographed  on  35mm  slides  in  the  form 
of  transparent  10  X l4  dot  matrix  arr'ays  on  an  opaque 
high-density  black  background.  There  was  a total  of  80 
slides  in  each  slide  tray  which  included  two  sets  of  each 
font  and  8 additional  symbols. 

One  dot  was  centered  on  the  viewing  screen  to  act 
as  a fixation  point  for  the  subject  and  to  indicate  that 
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another  slide  was  in  place.  Prior  and  following  each 
test  stimulus,  a 19  msec  stimulus  mask  was  flashed  on 
the  screen.  The  mask  was  a bright  light  of  homogenous 
illumination  that  filled  the  complete  stimulus  viewing 
area  of  the  10  X l4  array.  Masking  of  a test  stimulus 
by  a flash  of  light  is  termed  blanking  (Ref  13;9l).  The 
mask  and  stimulus  time  were  the  same  for  each  subject. 
Figure  24  shows  the  trial  sequence. 


19  msec  15  msec  19  msec 

I J J 


Fixation  Mask  Stimulus  Mask  Fixation 

Point 


Figure  24.  Psychophysical  Testing  Sequence 


Masking  techniques  are  used  to  force  errors  to 
occur  by  deteriorating  the  viewing  conditions.  With 
continued  practice  a subject's  performance  increases. 
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If  all  fonts  achieve  a 100^  level,  then  there  is  no 
disparity  between  them  to  judge  their  legibility.  This 
high  performance  level  occured  when  a visual  noise  mask 
of  a random  array  of  white  dots  on  a dark  background  was 
used.  For  this  reason,  the  light  masking  was  used  to 
obtain  a 7O-8O  percent  starting  error  rate.  Masking 
interrupts  the  signal  to  the  brain  and  changes  the 
signal  to  noise  ratio  of  the  stimulus  by  increasing 
the  noise  component.  The  mask  that  follows  each  stimulus 
helps  to  decrease  the  effects  of  the  after  images 
(Ref  13s9^) . 

Apparatus 

The  testing  facility  consisted  of  a room  for  the 
subject  and  a room  for  the  experimenter  and  testing 
equipment.  A 12.7  cm  X 15.2  cm  viewing  screen  was  built 
into  the  connecting  wall  between  the  rooms.  The  subjects 
were  seated  approximately  one  meter  in  front  of  the  screen. 
Subjects  wore  a two-way  intercom  headphone  to  enable 
verbal  communication  with  the  experimenter.  A subject 
would  initiate  each  experimental  trial  by  pressing  the 
microswitch  which  would  activate  the  equipment. 

All  stimuli  were  rear-projected  onto  the  viewing 
screen.  The  stimuli  exposure  time  was  operated  by  an 
electronic  shutter  controlled  by  a Iconix  projection 
tachistoscopic  system.  A Kodak  Carousel  projector  with 
rauidom  access  was  used  to  project  the  test  stimuli. 


Procedure 


Before  the  first  testing  session,  each  subject  was 
briefed  on  the  task  involved.  They  v/ere  told  to  press 
the  microswitch  in  order  for  the  mask-stimulus  to  be 
flashed  on  the  screen.  They  would  respond  into  the 
microphone  with  the  symbol  they  identified,  and  guess 
if  ihey  were  undecided.  Before  each  testing  period,  the 
subjects  were  permitted  to  study  the  fonts  to  insure 
familiarity  of  stimuli  symbols.  Each  font  of  36  charac- 
ters was  contained  on  two  sheets  of  paper  21.5  cm  X 35*5 
cm.  Each  symbol  was  in  10  X 1^  dot-matrix  configuration 
with  2.5  cm  X 3.5  cm  in  the  X-Y  direction. 

After  each  subject  was  familiar  with  the  test  stim- 
uli; the  subject  was  seated  in  the  darkened  room.  He 
was  given  several  minutes  for  his  eyes  to  adjust  to  the 
room.  For  each  test  period,  each  subject  would  start 
with  a different  font  and  with  both  the  Lincoln/Mitre 
and  set  11  font  being  tested.  In  between  trials  the 
experimenter  would  reposition  the  slide  tray  at  random 
positions.  The  subject  was  given  a practice  run  consist- 
ing of  160  slides  on  the  first  font  to  be  tested.  The 
second  font  tested  normally  would  show  better  results 
due  to  the  learning  received  from  the  first  font.  By 
each  subject  testing  the  fonts  in  different  orders,  a 
compensation  for  the  learning  effects  was  achieved. 

The  experimental  conditions  were  carefully  controlled 
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to  insure  extraneous  factors  would  not  be  introduced 
into  the  study. 

A total  of  36  test  trials  were  conducted  over  a 3 
week  period  of  12  testing  days.  Each  subject  was  tested 
3 times  during  each  testing  day.  Over  10,300  symbols 
were  tested  by  both  subjects  over  the  testing  period. 

' Accurate  measurements  were  taken  to  specify  pre- 
cisely all  the  test  viewing  conditions.  The  average 
intensity  of  each  dot  was  5 footlamberts  (fL)  and  it 
varied  for  each  symbol.  For  a given  viewing  distance 
d,  the  visual  angle  0,  subtended  by  a symbol  of  size  h 
can  be  calculated  using  the  equation: 

0 = 2tan“^  (h/2d)  (15) 

For  a viewing  distance  of  1 meter  and  a symbol  size 
on  the  screen  of  6.5  cm  high  by  1.0  to  4.6  cm  wide, 
results  in  a viewing  angle  of  3.72°  vertically  and 
0.57°  to  2.64°  horizontally.  The  percent  active  area 
is  given  by  the  equation: 

emitter  size 

percent  active  area  = 2“  ^ (I6) 

(emitter  spacing)'^ 

Emitter  spacing  is  the  distance  between  the  centers  of 
adjacent  dots,  while  emitter. size  is  the  area  of  each 
dot.  The  approximate  diameter  of  each  dot  was  0.4  cm. 
The  spacing  between  dots  was  0.5  cm  with  resulting 
active  area  equal  to  50^. 
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V.  Results 


Computer  Algorithm  Results 

The  symh  Is  of  the  final  set  11  that  were  changed 
from  the  original  Lincoln/Mitre  set  are  indicated  by  a 
underline  in  Appendix  B.  A total  of  18  symbols  were 
changed.  The  histogram  for  the  final  set  11  inter- 
prototype Euclidian  distances  is  shown  in  Figure  25. 

The  Lincoln/Mitre  symbols  with  the  smaller  distance 
separation,  v/hich  is  represented  by  the  left  side  of 
the  histogram,  were  spread  further  apart.  By  comparing 
the  histogram  of  the  Lincoln/Mitre  set  to  that  of  the 
final  set,  the  distance  shifting  can  be  verified.  The 
minimum  distance  for  the  final  set  is  .8307  which  is  the 
distance  between  "P"  and  "F."  The  original  distance  for 
the  worst  pair  of  the  Lincoln/Mitre  set  was  .6251,  also 
for  "P"  and  "F."  The  mean  distance  between  all  symbols 
also  increased  from  1.3706  to  1.3755  in  the  final  set. 
The  standard  deviation  was  .1537  in  the  final  set  and 
.1766  in  the  Lincoln/Mitre. 

Taking  a closer  look  at  the  distance  spread,  the 
symbol  change  objective  based  on  Euclidian  distance  was 
achieved  in  moving  the  closest  symbols  in  order  to  in- 
crease the  distance  between  them  and  the  other  symbols 
in  the  set.  A comparison  of  the  number  of  symbols  in 
each  distance  increment  and  its  associated  bin  number 
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is  shovm  in  Table  XI. 

In  the  first  20  bins,  which  includes  distances  up 
to  and  including  1.00,  the  final  set  had  a total  of  l4 
symbols  whose  nearest  neighbors  were  within  this  dis- 
tance range;  this  compares  to  46  symbols  in  the  original 
Lincoln/Mitre  font. 

Table  XII  shows  each  symbol  and  the  distance  to  its 
nearest  neighbor  for  both  the  Lincoln/Mitre  and  final 
set.  In  the  final  set,  33  symbols  showed  an  increase 
in  the  distance  to  their  nearest  neighbor  from  the  orig- 
inal set.  The  10  symbols  that  showed  the  most  change 
were  the  E , 0 ,U ,L, R ,T , 1 , 5 .P .F  in  that  order.  These  ten 
symbols,  but  not  in  the  exact  order,  were  the  same  sym- 
bols in  the  first  ten  ranking  by  symbol  confusability . 
Psychophysical  Testing  Results 

The  psychophysical  results  indicated  that  the  test 
was  more  difficult  in  the  early  trials.  Near  the  end 
of  the  test  a ceiling  effect  was  being  approached  and 
performance  was  nearly  85^.  The  final  test  set  11  show- 
ed a significant  advantage  over  the  Lincoln/Mitre  set 
in  the  early  part  of  the  test  as  is  shown  in  the  cu- 
mulative percent  correct  by  trial  in  Figure  26.  Under 
higher  stress  conditions,  the  final  set  seemed  to  per- 
form better  and  as  the  test  got  easier,  both  sets  per- 
formance asymptotically  approached  the  same  level.  A 
learning  effect  was  associated  with  the  test  and  showed 
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Table  XI 


Bin  Number  Data  for  Lincoln/Mitre  and  Final  Set 


Bin 

Numbers 

Range 

Lincoln/Mitre 

Symbols 

Final  Set 
Symbols 

1 

0.0  - .05 

0 

0 

2 

.05  - .10 

0 

0 

3 

.10  - .15 

0 

0 

4 

.15  - .20 

0 

0 

5 

.20  - .25 

0 

0 

6 

.25  - .30 

0 

0 

7 

.30  - .35 

0 

0 

8 

.35  - .40 

0 

0 

9 

.40  - .45 

0 

0 

10 

.45  - .50 

0 

0 

11 

.50  - .55 

0 

0 

12 

.55  - .60 

0 

0 

13 

.60  - .65 

4 

0 

l4 

.65  - .70 

4 

0 

15 

.70  - .75 

2 

0 

16 

.75  - .80 

0 

0 

17 

.80  - .85 

4 

2 

18 

.85  - .90 

6 

0 

19 

.90  - .95 

14 

8 

20 

.95  - 1.0 

12 

4 

21 

1.0  - 1.05 

20 

18 

22 

1.05  - 1.10 

28 

46 

23 

1.10  - 1.15 

46 

38 

24 

1.15  - 1.20 

62 

. 54 

25 

1.20  - 1.25 

70 

76 

26 

1.25  - 1.30 

- 92 

116 

27 

1.30  - 1.35 

120 

128 

28 

1.35  - 1.40 

148 

174 

29 

1.40  - 1.45 

174 

178 

30 

1.45  - 1.50 

148 

138 

31 

1.50  - 1.55 

l40 

124 

32 

1.55  - 1.60 

94 

82 

33 

1.60  - 1.65 

52 

60 

34 

1.65  - 1.70 

12. 

8 

35 

1.70  - 1.75 

4 

4 

36 

1.75  - 1.80 

4 

2 

37 

1.80  - 1.85 

0 

0 

38 

1.85  - 1.90 

0 

0 

39 

1.90  - 1.95 

0 

0 

4o 

1.95  - 2.00 

0 

0 
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Table  XII 

Lincoln/Mitre  and  Final  Set 
Distance  to  Nearest  Neighbor 


Symbol 


Nearest  Neighbor 
Tiincoln/i'litre  Final  Set 

Difference 

1.2806 

1.2806 

0 

.9761 

1.0539 

.0778 

1.1337 

1.1266 

-.0071 

.9761 

1.0698 

.0937 

.6255 

.9614 

.3359 

.6261 

.8307 

.2046 

1.0767 

1.0698 

- .0069 

.8933 

.9082 

.0149 

.8530 

1.0248 

.1718 

1.0828 

1.0828 

0 

1.0305 

1.0963 

.0613 

.8224 

1.0963 

.2739 

.9084 

1.0119 

.1035 

.8933 

.9082 

.0149 

.7098 

1.0303 

.3205 

.6251 

.8307 

.2056 

.9470 

.9735 

.0265 

.6730 

.9093 

.2363 

1.1676 

1.1676 

0 

.6779 

. .9040 

.2261 

.7098 

1.0303 

.3205 

1.0146 

1.0193 

.004? 

1.0146 

1.0193 

.0047 

1.0532 

1.0532 

0 

.9887 

1.0142 

.0255 

.9008 

1.0248 

.1240 

1.1021 

1.1366 

.0345 

.6779 

.9040 

.2261 

1.1794 

1.1794 

0 

1.0355 

1.0595 

.0240 

1.1785 

1.1785 

0 

.8462 

1.0595 

.2133 

1.0828 

1.0828 

0 

.9382 

1.0288 

.0906 

1.0355 

1.0539 

.0184 

1.1737 

1.1075 

-.0662 
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a decrease  in  the  percentage  of  errors  as  the  test  con- 
tinued. 

Of  the  two  subjects  tested,  one  had  been  previously 
tested  on  the  Lincoln/Mitre  font  over  a 7 month  period. 

As  expected,  he  performed  approximately  k-fo  better  on 
the  Lincoln/Mitre  font.  The  other  subject,  without  any 
prior  learning  of  the  Lincoln/Mitre  font,  performed 
approximately  better  on  the  final  set  11. 

Since  only  18  symbols  were  differenx  in  the  two 
sets,  a comparison  of  how  well  the  changed  symbols  per- 
formed between  fonts  would  be  interesting  information. 
Table  XIII  shows  the  number  of  errors  in  both  sets  for 
the  changed  symbols  only.  The  changed  symbols  iden- 
tification results  for  both  subjects  was  6^%  correct  in 
the  final  set  and  only  57  >5^°  in  the  Lincoln/Mitre  font. 
The  test  favored  the  new  changed  symbols  by  9.5%  with 
an  overall  increase  in  distance  to  their  nearest  neigh- 
bors. For  symbols  that  were  identical  in  both  sets, 
the  Lincoln/Mitre  set  was  favored  by  lZ.l%o  What  is  the 
disparity  in  such  a case  when  symbols  are  the  same  but  in 
different  sets?  One  reason  could  be  imperfections  in 
the  dot  size  and  arrangement  of  dots  on  the  slide  from 
one  font  to  the  other.  This  would  cause  variations  in 
light  intensity  and  contrast  between  the  symbols.  Dif- 
ferent error  rates  for  identical  symbols  may  mean  that 
recognition  performance  depends  on  the  learning  effect 
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Table  XIII 

Changed  Symbols  Total  Errors  for  Both  Fonts 


Symbol 

Identification 

Lincoln/Mitre 

Errors 

Final  Set 

C 

51 

23 

D 

81 

86 

E 

54 

56 

H 

58 

31 

I 

75 

25 

K 

90 

47 

L 

35 

4l 

N 

4o 

37 

0 

92 

53 

P 

89 

48 

R 

72 

60 

T 

46 

49 

U 

83 

69 

W 

17 

23 

z 

44 

39 

3 

37 

50 

5 

70 

29 

7 

58 

85 

Total 

1102 

856  . 

for  the  entire  font. 

Some  changed  symbols  like  the  "T"  and  the  "L"  had 
increased  thickness  or  stroke  width  with  slightly  higher 
error  rates.  Although  the  symbol  change  algorithm  had 
emphasized  the  features  that  were  thickened,  the  in- 
creased thickness  seemed  to  increase  legibility  of  the 
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symbol  up  to  a maximum  value;  wider  strokes  then  tended 
to  cause  more  errors  to  occur  in  identification. 

The  psychophysical  results  should  only  be  consid- 
ered as  a pilot  study  in  the  determination  of  the  leg- 
ibility between  the  Lincoln/Mitre  font  and  the  set  11 
font.  More  conclusive  results  can  be  made  with  further 
investigation  and  testing  using  more  subjects  over  a 
longer  test  period. 
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VI.  Conclusions  and  Recommendations 


Conclusions 

The  results  in  Chapter  V indicated  that  the  legibil- 
ity of  the  Lincoln/Mitre  alphanumeric  symbol  set  was 
improved  with  a new  font  which  demonstrated  better  per- 
formance when  the  identification  task  was  most  difficult. 
Also  important  is  the  fact  that  the  l8  changed  symbols 
in  the  new  set  11,  which  had  an  overall  greater  Euclidian 
distance  between  prototype  and  nearest  neighbors,  were 
less  confusing  than  the  Lincoln/Mitre  set.  These  results 
support  the  hypothesis  that  an  inter-prototype  Euclidian 
distance  matrix,  produced  from  a low-pass  filtered,  two- 
dimensional  discrete  Fourier  transform,  may  be  of  merit 
as  a human  visual  perception  model.  The  larger  the  inter 
prototype  distance  spread,  the  lower  the  recognition 
errors  and  more  dissimilar  the  symbols. 

The  symbol  change  algorithm  was  designed  to  move 
a prototype  away  from  its  two  closest  neighbors  and 
indicate  which  Fourier  component  features  were  the  most 
significant  after  symbol  changes.  Using  this  information 
the  changed  symbol’s  spatial  image  was  used  to  digitize 
a new  symbol  with  greater  distance  seperation  from  it  to 
its  nearest  two  neighbors. 

This  thesis  is  only  one  approach  to  finding  more 
legible  alphanumeric  symbol  sets . The  symbol  change 
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algorithm  effectively  decreases  the  discriminability 
error  between  symbols  based  on  increased  Euclidian 
distance  measures.  Table  XIV  summarizes  the  entire 
procedures  of  the  algorithm  used  to  arrive  at  new 
symbols . 

Recommendations 

It  is  recommended  for  further  study  that  the  out- 
lined procedures  of  Table  XIV  be  used  to  change  every  . I 

Lincoln/Mitre  symbol.  Design  a new  legible  alphanu- 

1 

meric  set  with  a greater  distance  spread  between  symbols. 

Use  the  same  procedure  to  produce  other  symbol  sets  not 
including  alphanumerics . 

It  is  suggested  that  the  final  set  11  be  tested 
using  a new  test  paradigm  in  which  the  error  rate  is 
standardized  for  each  subject  during  each  trial  by  apply- 
ing different  masking  levels.  Also  it  is  desired  to 
increase  the  number  of  subjects  to  10  and  increase  the 
number  of  trials  to  50.  Reaction  time  may  be  another 
testing  parameter  besides  subject  errors. 

The  present  process  of  photographing  each  symbol 
onto  a slide  has  a tendency  to  cause  random  variations 
in  the  quality  of  each  slide  and  in  the  shape  of  each 
symbol.  This  in  turn  could  cause  identification  errors 
that  have  not  been  accounted  for.  The  implementation 
of  a computer  controlled  system  rather  than  the  tach- 
istoscope  system  would  enhance  not  only  the  accuracy 
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Table  XIV 
Symbol  Algorithm 

1.  Digitalize  each  symbol  by  sampling  the  intensity 
pattern  of  the  symbol  by  use  of  the  flying  spot  scanner 
or  other  practical  method. 

2.  Center  symbol  in  zero  filled  matrix  array. 

3.  Using  FOURT  subroutine,  transform  each  digitally 
represented  symbol  into  its  Fourier  spatial  components. 

4.  Extract  features  by  low-pass  filtering  each  trans- 
formed symbol. 

5.  Energy  normalize  each  transformed  symbol  to  place 
on  the  hypersphere. 

6.  Compute  the  Euclidian  distance  between  each  filter- 
ed transformed  pair  and  print  the  Euclidian  distance 
matrix  for  the  complete  set. 

7.  Order  distances  and  print  out  ordered  Euclidian 
distance  matrix  for  complete  set. 

8.  Use  symbol  change  procedure  to  move  prototype  away 
from  two  closest  neighbors. 

9.  Take  inverse  Fourier  transform  of  changed  symbol 
and  obtain  its  spatial  image. 

10.  Using  the  changed  symbol’s  spatial  image,  digitalize 
a new  symbol. 

11.  Repeat  steps  2-7  to  verify  increased  Euclidian 
distance  matrix  for  changed  prototypes. 
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but  also  increase  the  availability  of  meaningful  results. 

Finding  other  methods  in  which  the  changed  symbol’s 
spatial  image  can  be  intei-rupted  to  give  exact  pixel 
elements  for  the  new  symbol  is  another  area  where  re- 
search is  needed. 

There  are  a variety  of  other  factors  besides  symbol 
shape  that  effect  legibility  that  can  be  explored. 
Shurtleff  (Ref  23)  has  experimented  with  many  legibil- 
ity factors  to  include  symbol  brightness,  contrast,  il- 
lumination, exposure  time,  spacing,  size,  stroke  width, 
and  viewing  area.  Other  factors  such  as  noise,  blur, 
vibration,  and  repetition  rate  effect  symbol  legibility. 

Appendix  E also  suggest  further  study  in  using  the 
symbols  of  legible  sets  to  form  suboptimal  sets  based 
on  Euclidian  distance.  These  sets  can  then  be  analyzed 
for  "optimum"  conditions. 
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ASCII,  Namel,  Huddleston,  and  IBM  Sets 
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Appendix  D 


Fourier  Transform  in  Fraunhofer  Diffraction 

Diffraction  is  the  deviation  of  light  from  its 
straight-line  propagation  resulting  from  the  inter- 
action with  the  edges  of  obstacles  it  encounters  in 
its  path.  Diffraction  can  be  classified  as  either 
Fraunhofer  or  as  Fresnel;  the  type  depends  on  the 
distances  of  the  light  source  and  the  viewing  screen 
from  the  diffracting  screen.  Fraunhofer  diffraction 
is  said  to  occur  when  the  diffraction  screen  is  far 
from  the  light  source  and  viewing  screen,  and  there 
is  only  a small  fraction  of  the  wavelength  between 
lines  drawn  from  the  viewing  point  or  source  to  all 
points  of  the  aperture  (Ref  26:24,25)* 

Fraunhofer  diffraction  displays  a Fourier  trans- 
form pattern  on  the  far  field  from  a diffracting  aper 
ture . The  Fourier  transform  is  given  by: 

jjc 

'/'(x,y)  = //AC  ^,T))e^  (fx  + T?y)dfdi7 
where  describes  the  amplitude  and  phase  of  the 

field  across  the  aperture  and  '/'(x,y)  discribes  the 
field  in  the  focal  plane. 

Consider  a rectangular  aperture  centered  on  the 
axis  of  the  f and  n plane . Let  the  width  equal  to  2a 


(17) 
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b 

-a 

a 

-b 

Figure  27 • Rectangular  Aperture 


and  the  length  equal  to  2b  as  shown  in  Figure  2?  above. 
Then  the  Fourier  transform  of  that  occurs  in  the 

observation  plane  will  be: 

ik 

'l»(x,y)  = Xb  A(f,i7)e^  (^x  + Vy)d^d-n  (18) 

and  after  the  integration  is  performed  the  following  is 
obtained; 

</'(x,y)  = 4Aab  sinc(-^)  sinc(^^^)  (19) 

Figure  28  shows  a cross  section  of  the  Fraunhofer  dif- 
fraction pattern  along  the  X axis  (Ref  18:4,5). 

Using  monochromatic  light  from  a laser  source  and 
the  setup  shown  in  Figure  29  on  the  next  page,  the 
Fraunhofer  diffraction  pattern  for  a wire  mesh  aperture 
was  photographed  on  the  back  focal  plane  of  the  imaging 
lens.  The  pattern  is  shown  in  Figure  30.  The  wire  mesh 
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Figure  28.  Cross  Section  of  Fraunhofer  Dif- 
fraction Pattern  along  X Axis 
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Figure  30.  Fraunhofer  Diffraction  Pattern  of 


Wire  Mesh 

image  is  restructured  from  the  various  Fourier  compon- 
ents and  is  photographed  at  the  image  plane  in  Figure 
31. 


Figure  31 . Restructured  Wire  Mesh  Image 
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Figure  32.  Horizontal  Image  Using  Vertical  Slit 


Spatial  filtering  techniques  can  he  demonstrated 
hy  placing  a narrow  vertical  slit  in  the  focal  plane. 

This  slit  will  pass  only  a single  row  of  Fourier  com- 
ponents. The  corresponding  image  contains  only  the  hor- 
izontal structure  of  the  wire  mesh  as  shown  in  Figure 
32.  The  opposite  is  true  for  the  horizontal  slit  filter. 

The  number  "4"  was  photographed  onto  a slide  and 
was  used  in  place  of  the  wire  mesh.  The  Fraunhofer 
diffraction  pattern  of  the  "4"  was  obtained.  An  adjust- 
able iris  was  used  to  eliminate  all  the  spatial  frequen- 
cies greater  than  the  first  harmonic  and  the  filtered 
diffraction  pattern  was  photographed  in  Figure  33-  The 
reproduced  image  was  a blurry  "4"  which  was  double  ex- 
posed with  the  diffraction  plane  and  is  also  shown  in 
Figure  33*  The  high  frequency  components  that  yield 
edge  information  are  filtered  out.  This  preserves  only 
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the  basic  form  of  the  original  image. 


Figure  33-  Number  "4"  with  Spatial  Filter 
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Appendix  E 


Optimization  of  Legible  Sets  Using  Distance  Measure 

For  N alphanumeric  symbol  sets,  each  set  consisting 
of  36  prototypes,  A - Z and  0 - 9»  the  most  "optimal" 
set'  is  defined  as  the  combination  36  symbol  alphanu- 
meric set  with  the  maximum  Euclidian  distance  between 
every  prototype.  To  manually  find  this  set  given  a 36N 
by  36N  Euclidian  distance  matrix  is  normally  infeasible 
due  to  the  large  combinations  of  possible  symbol  sets; 
therefore,  a method  is  desired  that  will  limit  the  num- 
ber of  investigations  and  at  the  same  time  find  candidates 
consisting  of  suboptimal  sets. 

The  first  procedure  described  developes  a new  sym- 
bol set  from  prototypes  of  several  legible  fonts.  Any 
candidate  set,  in  which  the  distance  between  any  two 
symbols  is  less  than  an  established  minimum  distance,  is 
excluded  from  consideration.  The  objective  is  to  find 
combination  sets  that  will  have  an  increased  overall 
Euclidian  distance  from  each  symbol  to  its  nearest  neigh- 
bor. 

In  this  process,  a computer  algorithm  is  used,  in 
which  all  symbols  have  been  spatially  filtered  and  their 
corresponding  Fourier  coefficients  locate  each  symbol  in 
a higher  order  space.  An  Euclidian  distance  matrix  was 
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then  calculated  between  all  prototypes. 

A 36N  by  36N  matrix  of  Euclidian  distance  values 
would  take  up  a large  amount  of  computer  space.  Since 
the  distance  data  is  symmetrical  about  the  diagonal,  this 
information  is  utilized  to  only  store  half  of  the  values. 
An  array  is  generated  which  contains  the  distance  and 
a pointer  to  its  location  in  the  original  matrix.  This 
array  of  distances  with  location  symbol  pairs  is  then 
sorted  from  largest  to  smallest. 

Once  the  sorting  is  complete,  the  search  begins  to 
find  the  first  complete  set  with  the  largest  distance 
between  symbols.  No  distance  is  accepted  smaller  than 
the  minimum  distance  which  is  called  Zmin.  A counter  I 
is  used  and  initilized  to  one.  The  following  steps  are 
performed: 

1.  Starting  at  the  Ith  point  in  the  sorted  array, 
the  symbol  pair  at  that  position  becomes  the  first 
two  symbols  for  the  suboptimal  set  called  array  lA . 
If  the  distance  of  the  pair  at  I is  not  greater  than 
Zmin,  the  process  will  stop. 

2.  The  next  lower  pair  of  symbols  in  the  sorted 
array  is  checked.  If  the  distance  for  this  pair  is 
less  than  Zmin,  then  a complete  set  cannot  be  found. 
If  one  of  the  symbols  is  already  in  array  lA,  the 
procedure  will  be  continued  but  that  symbol  will 
not  be  stored  in  array  lA. 
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3.  Another  check  is  made  to  confirm  that  a symbol 
class  already  placed  in  array  lA  is  removed  from 
further  consideration.  For  example,  if  a "R"  of 
one  set  had  been  choosen,  then  no  more  "R"s  are 
needed.  In  this  case,  the  procedure  woxild  proceed 
to  the  next  pair  of  the  sorted  array. 

4.  Before  storing  a new  symbol  in  array  lA,  the 
distance  between  it  and  all  other  symbols  in  array 
lA  are  checked.  If  all  distances  are  greater  than 
Zmin,  that  symbol  is  accepted  into  the  suboptimal 
set.  If  the  distance  is  less  than  Zmin,  the  pro- 
cedure starts  over  at  step  2 again. 

5.  Once  a complete  suboptimal  set  is  found,  it  is 
printed  along  with  its  distance  matrix.  The  counter 
I is  increased  by  one  either  when  a complete  set  is 
found  or  when  after  all  possible  checks,  a complete 
set  cannot  be  formed.  The  process  will  then  start 
over  at  step  1 . 

The  final  suboptimal  sets  obtained  will  have  a min- 
imum interprototype  distance  equal  or  greater  than  the 
Zmin  value.  Test  can  then  be  performed  on  these  sub- 
optimal sets  to  determine  which  ones  are  optimal  under 
different  conditions.  Different  statistical  methods  can 
be  applied  to  discriminate  between  sets. 

Another  procedure  suggested  uses  a symbol  set  mean 
distance  between  all  prototypes  as  a measurement  criterion. 
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Given  N alphanumeric  symbol  sets,  the  one  with  the  larg- 
est mean  distance  is  used  as  the  model  set.  Starting 
with  the  first  symbol,  the  "A"  is  exchanged  for  an  "A" 
of  another  set  and  a new  mean  distance  is  calculated. 

In  turn,  after  all  the  "A"s  of  all  the  sets  are  tested, 
the  "A"  that  gives  the  highest  overall  mean  is  saved  in 
the  model  set.  The  procedure  then  continues  for  the  "B"s 
and  so  on  until  all  symbols  have  been  tested  and  a final 
new  symbol  set  obtained. 

The  above  procedures  are  only  attempts  at  finding 
"optimal"  sets  using  distance  measures.  The  actual 
choice  of  "optimal"  sets  depends  upon  the  peculiar 
nature  of  each  problem  and  the  manner  in  which  its  al- 
gorithm is  implemented. 


Appendix  F 

Window  Effect  on  the  Distance  Matrix 

Prior  to  taking  the  Fourier  transform  of  a symbol, 
each  symbol  was  centered  in  a zero  filled  window  that 
was  three  times  its  size.  The  window  was  used  to  elim- 
inate or  reduce  edge  effects,  aliasing  and  leakage  con- 
ditions that  could  result  from  the  periodic  nature  of 
the  Fourier  transform.  What  effect  does  the  window 
have  on  the  Euclidian  distance  matrix?  This  question 
was  answered  by  comparing  the  distance  matrix  for  a sym- 
bol set  using  10  X l4  symbols  centered  in  a window  to 
one  obtained  for  a symbol  set  whose  symbols  occupied 
the  entire  30  X 42  window  space.  Fourier  analysis  and 
low-pass  filtering  were  performed  on  both  sets. 

The  ASCII  symbol  set  shown  in  Appendix  C was  used. 
Each  symbol  in  the  set  was  also  enlarged  three  times 
its  size  by  an  algorithm  that  assigned  nine  "l"s  for 
each  digitalized  "1"  of  the  original  10  X l4  symbol. 

In  this  way  a new  30  X 42  ASCII  symbol  set  was  develop- 
ed. The  bigger  symbols  had  more  of  a staircase  effect 
due  to  the  scaling  problem  that  is  created  by  the  dig- 
italizing process.  This  in  turn  can  induce  some  error 
into  the  distance  matrix  since  it  is  not  an  identical 
enlargement  of  the  original  symbol  but  a digital  repre- 
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sentation . 


The  ordered  Euclidian  distance  matrix  was  calcu- 
lated for  the  ASCII  set  with  and  without  a window.  Part 
of  the  ordered  distance  matrix  for  both  sets,  A - L,  is 
shown  in  Table  XV  and  in  Table  XVI.  Checking  the  first 
two  nearest  neighbors  for  all  symbols,  only  the  letter 
"E’'"  showed  a difference  and  this  was  in  the  2nd  neigh- 
bor. The  window  effect  was  insignificant  in  changing 
the  position  of  the  nearest  neighbors  and  showed  only 
slight  deviations  in  positions  for  the  entire  set. 
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Table  XV 

ASCII  10  X l4  - Ordered  Distance  Matrix  A-L 
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Table  XVI 

ASCII  30  X 42  - Ordered  Distance  Matrix  A-L 
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